Nonlinear Manifold Learning for Visual Speech Recognition

نویسندگان

  • Christoph Bregler
  • Stephen M. Omohundro
چکیده

A technique for representing and learning smooth nonlinear manifolds is presented and applied to sev­ eral lip reading tasks. Given a set of points drawn from a smooth manifold in an abstract feature space, the technique is capable of determining the structure of the surface and of finding the closest manifold point to a given query point. We use this technique to learn the "space of lips" in a visual speech recognition task. The learned manifold is used for tracking and extracting the lips, for interpolating between frames in an image se­ quence and for providing features for recognition. We describe a system based on Hidden Markov Models and this learned lip manifold that significantly improves the performance of acoustic speech recognizers in degraded environments. We also present preliminary results on a purely visual lip reader.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

بهبود مدل تفکیک‌کننده منیفلدهای غیرخطی به‌منظور بازشناسی چهره با یک تصویر از هر فرد

Manifold learning is a dimension reduction method for extracting nonlinear structures of high-dimensional data. Many methods have been introduced for this purpose. Most of these methods usually extract a global manifold for data. However, in many real-world problems, there is not only one global manifold, but also additional information about the objects is shared by a large number of manifolds...

متن کامل

Using Locality Sensitive Hashing for Fast Computation of Correlational Manifold Learning based Robust Features

This paper considers the application of a random projections based hashing scheme, known as locality sensitive hashing (LSH), for fast computation of neighborhood graphs in manifold learning based feature space transformations in automatic speech recognition (ASR). Discriminative manifold learning based feature transformations have already been found to provide significant improvements in ASR p...

متن کامل

Face recognition based on manifold learning and Rényi entropy

Though manifold learning has been successfully applied in wide areas, such as data visualization, dimension reduction and speech recognition; few researches have been done with the combination of the information theory and the geometrical learning. In this paper, we carry out a bold exploration in this field, raise a new approach on face recognition, the intrinsic α-Rényi entropy of the face im...

متن کامل

A Geometric Perspective on Speech Sounds

In order to effectively approach high dimensional pattern recognition problems, one seeks to understand and exploit any inherent low dimensional structure. Recently, a number of manifold learning algorithms have been motivated by a geometric point of view that models high dimensional data as lying near a low dimensional submanifold of the original space. Our paper has two main goals: (i) to inv...

متن کامل

Nonlinear Image Interpolation using Manifold Learning

The problem of interpolating between specified images in an image sequence is a simple, but important task in model-based vision. We describe an approach based on the abstract task of "manifold learning" and present results on both synthetic and real image se­ quences. This problem arose in the development of a combined lip-reading and speech recognition system.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995